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(57) Abstract: A method for improving the spatial perception of multiple sound channels when reproduced by two loudspeakers, 
generally front-located with respect to listeners, each channel representing a direction, applies some of the channels, such as sound 
channels representing directions other than front directions, to the loudspeakers with headphone and crosstalk cancelling processing, 
and applies the other ones of the sound channels, such as sound channels representing front directions to the loudspeakers without 
headphone and crosstalk cancelling processing. The headphone processing includes applying directional HRTFs to channels applied 
to the loudspeakers with headphone and crosstalk cancelling processing and may also include adding simulated reflections and/or 
artificial ambience to channels applied to the loudspeakers with headphone and crosstalk cancelling processing. 
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DESCRIPTION 
Method for Improving Spatial Perception in Virtual Surround 

5 

TECHNICAL FIELD 

The invention relates to audio signal processing. More particularly, the 
invention relates to improving the spatial perception of a multichannel sound source 
when reproduced by two loudspeakers. 

10 

BACKGROUND ART 
Multichannel sound reproduction systems such as Dolby Pro Logic or 
Dolby Digital (Dolby, Dolby Pro Logic and Dolby Digital are trademarks of Dolby 
Laboratories Licensing Corporation) require, for example, five speakers, placed at 
15 particular locations and particular angles. This can be costly and space consuming. It 
would be desirable to have surround sound without rear loudspeakers, to save on cost 
and space. However, conventionally, front loudspeakers only provide front sound 
images. 

It is known to process multiple channels representing sounds from many 
20 directions, and combine them into two signals for reproduction over headphones, 
retaining the apparent multiple directions. With headphone reproduction the left 
signal goes to the left ear, and the right to the right, with no crosstalk. Sounds can 
appear to come from the sides of die listener as well as from die front, or in some 
cases the rear. 

25 Considering each of the multichannel inputs as representing sound from a 

particular direction, such processing for headphones typically includes at least 
applying appropriate HRTFs (head related transfer functions) to each input to 
simulate the paths from its desired apparent direction to the two ears, so that die 
headphone listener perceives each channel as coming from the desired direction. 

30 Such headphone processors, which provide two outputs in response to more'than two 
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inputs, are referred to by a variety of names such as "multi-axis binaural steering" 
processors, "multi-channel binaural synthesizers", "headphone virtual surround" 
processors, and the like. Some headphone processors also provide processing in 
addition to applying directional HRTFs, such as adding simulated reflections and/or 
5 artificial ambience to one or more of the channels. All such processors, whether 
employing only directional HRTFs or also additional processing, such as artificial 
reflections and/or ambience, are referred to herein as "headphone processors." Some 
examples of headphone processors include those described in published International 
Application WO 99/14983 (designating the United States) and in U.S. Patents 

10 5,371,799; 5,809,149; and 6,195,434 Bl. Each of said application and patents are 
hereby incorporated by reference, each in their entirety. 

Conventional two-channel stereophonic material is intended for 
reproduction over two loudspeakers. Each of the listener's ears receives sound from 
both loudspeakers, with, of course, different path lengths and frequency responses. 

15 In other words, there is acoustic crosstalk. In general, all sounds so reproduced 
appear to lie within the space between die loudspeakers. 

It is also known to modify signals prior to application to two loudspeakers to 
cancel the acoustic crosstalk, at least partially. This allows the apparent position of 
sounds to lie well outside the space between the loudspeakers, and is the basis of 

20 "virtual surround" processes. To the extent that the crosstalk is cancelled, the sounds 
entering the ears from the two loudspeakers resemble those provided by headphones, 
i.e., without crosstalk. Crosstalk cancellers (sometimes referred to as "spatializers" 
or "panoramic processors") are well known in the art, dating at least from U.S. Patent 
3,236,949 (Atal and Schroeder), which patent is hereby incorporated by reference in 

25 its entirety. A computer-software-implemented acoustic-crossfeed canceller using 
very low processing resources of a personal computer is disclosed in U.S. Patent 
Application S.N. 08/819,582 of Davis et al, filed March 14, 1997, which application 
is hereby incorporated by reference in its entirety. 

As is also known, signals representing multiple channels, including sounds 

30 originally coming from outside the space between die loudspeakers can be processed 
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as if for reproduction over headphones and then fed via an acoustic crosstalk 
canceller to two front loudspeakers arranged in a conventional stereo configuration, 
such as at the sides of a computer monitor or a television picture tube. This 
combination of headphone processing and crosstalk cancellation allows the apparent 
position of sound sources to lie to the sides, or in some cases the rear, using only a 
pair of front loudspeakers. 

FIG. 1 is a schematic block diagram showing a prior art arrangement in 
which the multiple channels of a multichannel source, such as a five-channel source 
(each channel representing a direction, such as left front, center front, right front, left 
surround and right surround), are applied to a headphone processor 2. The two 
outputs of the headphone processor are applied to a crosstalk canceller 4, which also 
has two outputs. One output of the crosstalk canceller is applied to a first 
loudspeaker 6 and the other output is applied to a second loudspeaker 8. 

The combination of headphone processing and crosstalk cancellation 
feeding a pair of loudspeakers is superior to a crosstalk canceller alone because the 
processing for headphone reproduction introduces additional directional cues by 
introducing directional HRTFs (crosstalk cancellers may include only "one ear to the 
other" HRTFs) and, in some headphone processors, simulated multiple acoustic paths 
(including reflections) between apparent image positions (outside the loudspeakers) 
and the listener's ears. Thus, with combined headphone processing and crosstalk 
cancellation, virtual sound images may appear not only at die sides of a listener's 
head but also from further back. 

However, there are disadvantages of such a combined headphone 
processing and crosstalk cancellation scheme. The front sound channels (left front 
center front, right front) of die multichannel source are intended to be reproduced 
over loudspeakers and are satisfactorily reproduced by two loudspeakers that 
reproduce die left front and right front channels and also provide a virtual or 
"phantom" center front image (provided, of course, that the listener is appropriately 
located with respect to the two loudspeakers). Consequendy, processing the front 
sound channels is not necessary and should be avoided (in accordance with the "least 
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treatment" principle). Headphone processing of the front channels involves at least 
the application of directional HRTFs that may cause colorations or changes in timbre, 
for example. Other headphone processing techniques, for example the simulation of 
reflections or reverberation, may introduce other noticeable and unnecessary 
5 alterations of the front channel signals or may produce artifacts. Crosstalk 

cancellation may also adversely affect the front channels. Crosstalk cancellation is 
most effective when the playback environment, the listening room, introduces little 
by way of reflections. Consequently, in practical "real listening room" applications, 
crosstalk cancellation is incomplete. Thus, even if headphone processing of the front 
10 channels were transparent, the subsequent crosstalk cancellation in prior art of the 
type shown in FIG. 1 would likely impair the reproduced front channel sound. 

In accordance with the present invention, impairment of the front 
channel reproduction is avoided while retaining the benefits of improved surround 
channel reproduction from a pair of loudspeakers. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a schematic block diagram showing a prior art arrangement in which 
the multiple channels of a multichannel source, such as a five-channel source (each 
channel representing a direction, such as left front, center front, right front, left 
20 surround and right surround), are applied to a pair of front-located loudspeakers via a 
headphone processor and a crosstalk canceller. 

FIG. 2 is an idealized functional block diagram of an arrangement in 
accordance with the present invention. 

25 DISCLOSURE OF THE INVENTION 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
FIG. 2 shows an idealized functional block diagram of an arrangement in 
accordance with the present invention that receives the multiple channels of a 
multichannel source, such as a five-channel source (each channel representing a 
30 direction, such as left front (L), center front (C), right front (R), left surround (Ls) 
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and right surround (Ls)), applies the secondary channels (such as left surround and 
right surround) via a headphone processor and a crosstalk canceller to a pair of front- 
located loudspeakers and applies the main channels (such as left, center, right) to the 
pair of front-located loudspeakers without headphone or crosstalk cancelling 
5 processing. 

The source of the multidirectional sound sources applied to the arrangement of 
FIG. 2 is not critical and may be any suitable source, including, for example, a Dolby 
Pro Logic source, a Dolby Digital source, a Digital Theater Systems Corporation 
("DTS") source ("DTS" is a trademark), a discrete source, or some other source. 
10 Although the invention will be described in connection with an embodiment that has 
three main channels and two secondary channels, the invention is not so limited. For 
example, there may be only two main channels, such as left and right, and/or there 
may be more than two secondary channels, such as five secondary channels (for 
example, left front surround (LFS), left rear surround (LRS), right front surround 
15 (RFS), right rear surround (RRS) and center surround (CS)). The number of 

secondary channels is limited only by the complexity of the headphone processor and 
its ability to simulate the placement of sounds in a large number of directions. 

As shown in FIG. 2, a portion of the arrangement is a conventional prior art 
Dolby MP Matrix encoder configured as a 3:2 encoder. The matrix encoder 10 
20 accepts three separate input signals; left front, center front, and right front (L, C, R), 
and creates two final outputs, left total and right total (Lt and Rt). The C input is 
divided equally and summed with the L and R inputs with a 3 dB level reduction in 
order to maintain constant acoustic power. 

The left-total (Lt) and right-total (Rt) encoded signals may be expressed as 
25 Lt = L + 0.707 C; and 

Rt = R + 0.707 C, 

where L is the left front input signal, R is the right front input signal, and C is the 
center front input signal. When the Lt encoded signal is reproduced by a left-located 
front loudspeaker and the Lt encoded signal is reproduced by a right-located front 
30 loudspeaker, a virtual or "phantom" center channel image may be perceived by a 
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properly located listener. The use of a center channel is not critical and may be 
omitted, in which case the L and R input signals may be coupled directly to the 
loudspeakers without any requirement for a matrix to mix in the center channel. If an 
encoder matrix is employed, it need not mix in the center channel at -3 dB but may 
5 employ some other mixing level. In any case, in accordance with the present 
invention, the main channels intended for reproduction by two front-positioned 
loudspeakers (such as die left front, center front (if employed) and right front 
channels) are not applied to the two loudspeakers via a headphone processor and/or a 
crosstalk canceller. 

10 Still referring to FIG. 2, the left surround (Ls) and right surround (Rs) 

supplemental channel signals are applied to the left surround (Ls) and right surround 
(Rs) inputs of a headphone processor 12. Headphone processor 12 has characteristics 
such as described above. Such headphone processors may also have inputs for left 
front (L), center front (C) and right front (R) signals, as shown in FIG. 2; however, 

15 those inputs are not used. As explained above, there may be additional supplemental 
channel signals applied to headphone processor 12 provided that the device is 
capable of processing more than two secondary channel inputs. Headphone 
processor 12 provides two output signals, left headphone (Lh) and right headphone 
(Rh). These outputs are intended to provide a headphone listener with the perception 

20 that each of the secondary channel inputs is coming from the desired direction. The 
Lh and Rh output signals are not applied to headphones but to a crosstalk canceller 
14 that, in turn, provides crosstalk cancelled versions of the Lh and Rh signals, 
designated here as left canceller (Lc) and right canceller (Rc). The Lc signal is 
additively combined with the Lt signal in a summer 16 to produce a left virtual (Lv) 

25 and the Rc signal is additively combined with the Rt signal in a summer 1 8 to 

produce a right virtual (Rv) signal. The Lv signal may then be coupled to a suitable 
left-positioned front-located loudspeaker (not shown) and the Rv signal may then be 
coupled to a suitable right-positioned front-located loudspeaker (not shown). 
Reproduction of the Ly and Rv signals by such loudspeakers provides a properly 

30 located listener with the perception of main channel sounds without the shortcomings 
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of headphone processor and/or crosstalk canceller processing while providing 
enhanced phantom images of the secondary channel sounds. 

It should be understood that implementation of other variations and 
modifications of the invention and its various aspects will be apparent to those skilled 
5 in the art, and that the invention is not limited by these specific embodiments 
described. 

The present invention and its various aspects may be implemented in 
hardware, or as software functions performed in digital signal processors, 
programmed general-purpose digital computers, and/or special purpose digital 
10 computers, or as a combination of hardware and software functions. Interfaces 

between analog and digital signal streams may be performed in appropriate hardware 
and/or as functions in software and/or firmware. 
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CLA1MS 

1. A method for improving the spatial perception of multiple sound channels 
when reproduced by two loudspeakers, each channel representing a direction, 
comprising 

5 applying some of said channels to said loudspeakers with headphone and 

crosstalk cancelling processing, and 

applying the other ones of said sound channels to said loudspeakers without 
headphone and crosstalk cancelling processing. 

10 2. A method according to claim 1 wherein said two loudspeakers are generally 

front-located with respect to listeners and wherein sound channels representing 
directions other than front directions are applied to said loudspeakers with headphone 
and crosstalk cancelling processing and sound channels representing front directions 
are applied to loudspeakers without headphone and crosstalk cancelling processing. 

15 

3. A method according to claim 1 or claim 2 wherein said headphone 
processing includes applying directional HRTFs to channels applied to said 
loudspeakers with headphone and crosstalk cancelling processing. 

20 4. A method according to claim 3 wherein said headphone processing further 

includes adding simulated reflections and/or artificial ambience to channels applied 
to said loudspeakers with headphone and crosstalk cancelling processing. 

5. A method according to any one of claims 1-4 wherein applying sound 
25 channels to said loudspeakers without headphone and crosstalk cancelling processing 
includes encoding such sound channels to reduce the number of such sound channels 
to two when there are more than two of such sound channels. 



30 
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6. A method according to claim 5 wherein said encoding comprises matrix 
encoding. 

7. A method according to claim 6 wherein said matrix encoding is 3:2 matrix 
5 encoding. 

8. Audio apparatus for improving the spatial perception of multiple sound 
channels when reproduced by two loudspeakers, each channel representing a 
direction, comprising 

10 a processor receiving some of said sound channels and delivering two output 

signals, said channel processor including a headphone processor employing 

directional HRTFs and a crosstalk canceller, 

a first additive combiner receiving one of the outputs of said processor and 

receiving the channels other than the channels applied to said processor with relative 
15 proportions in accordance with their directions, and providing a signal for one of said 

loudspeakers, 

a second additive combiner receiving the other of the outputs of said processor 
and receiving the channels other than the channels applied to said processor with 
relative proportions in accordance with their directions, and providing a signal for the 
20 other of said loudspeakers. 

9. The apparatus of claim 8 wherein said two loudspeakers are generally front- 
located with respect to listeners and wherein sound channels representing front 
directions are coupled to the first and second additive combiners and sound channels 

25 representing directions other than front directions are coupled to said headphone 
processor. 

10. The apparatus according to claim 8 or claim 9 wherein said headphone 
processor further includes a simulated reflections and/or artificial ambience 

30 processor. 
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11. The apparatus according to any one of claims 8-10 further comprising an 
N:2 matrix encoder, wherein ones of the multiple sound channels not coupled to the 
headphone processor are coupled to said additive combiners via the N:2 encoder. 

5 

12. The apparatus according to claim 8 wherein there are five sound channels, 
L, C, R, Ls, and Rs, said processor receiving said Ls and Rs signals, said L, C, and R 
channels applied to said first and second additive combiners with relative proportions 
such that all of the L channel and none of the R channel is received by one of the 

10 combiners, all of the R channel and none of the L channel is received by the other of 
the combiners, and a substantially equal proportion of the C channel is received by 
each of the combiners. 



* 
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